Adapting Rankers Online

نویسندگان

  • Katja Hofmann
  • Shimon Whiteson
  • Maarten de Rijke
چکیده

At the heart of many effective approaches to the core information retrieval problem— identifying relevant content—lies the following three-fold strategy: obtaining contentbased matches, inferring additional ranking criteria and constraints, and combining all of the above so as to arrive at a single ranking of retrieval units. Over the years, many models have been proposed for content-based matching, with particular attention being paid to estimations of query models and document models. Different task and user scenarios have given rise to the study and use of priors and noncontent-based ranking criteria such as freshness, authoritativeness, and credibility. The issue of search result combinations, whether ranked-based, score-based or both, has been a recurring theme for many years. As retrieval systems become more complex, learning to rank approaches are being developed to automatically tune the parameters for integrating multiple ways of ranking documents. This is the issue on which we will focus in the talk. Search engines are typically tuned offline; they are tuned manually or using machine learning methods to fit a specific search environment. These efforts require substantial human resources and are therefore only economical for relatively large groups of users and search environments. More importantly, they are inherently static and disregard the dynamic nature of search environments, where collections change and users acquire knowledge and adapt their search behaviors. Using online learning to rank approaches, retrieval systems can learn directly from implicit feedback, while they are running. The talk will discuss three issues around online learning to rank: balancing exploitation and exploration, gathering data using one pair of rankers and using it to compare another pair of rankers, and the use of rich contextual data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multileaving for Online Evaluation of Rankers

In online learning to rank we are faced with a tradeoff between exploring new, potentially superior rankers, and exploiting our preexisting knowledge of what rankers have performed well in the past. Multileaving methods offer an attractive approach to this problem since they can efficiently use online feedback to simultaneously evaluate a potentially arbitrary number of rankers. In this talk we...

متن کامل

Don’t Hurt Them: Learning to Rank from Historical Interaction Data (Keynote)

One of the main advantages of online evaluation schemes is that they are user-based and, as a result, often assumed to give us more realistic insights into the real system quality than off-line methods. This is also one of their main disadvantages: comparing two rankers online requires presenting users with result lists based on those rankers and observing how users interact with them. New rank...

متن کامل

Multiple Aspect Ranking for Opinion Analysis

We address the problem of analyzing multiple related opinions in a text. For instance, in a restaurant review such opinions may include food, ambience and service. We formulate this task as a multiple aspect ranking problem, where the goal is to produce a set of numerical scores, one for each aspect. We present an algorithm that jointly learns ranking models for individual aspects by modeling t...

متن کامل

Bayesian Aggregation of Order-Based Rank Data

Rank aggregation, that is, combining several ranking functions (called base rankers) to get aggregated, usually stronger rankings of a given set of items, is encountered in many disciplines. Most methods in the literature assume that base rankers of interest are equally reliable. It is very common in practice, however, that some rankers are more informative and reliable than others. It is desir...

متن کامل

Search Engines that Learn from Their Users

More than half the world’s population uses web search engines, resulting in over half a billion queries every single day. For many people, web search engines such as Baidu, Bing, Google, and Yandex are among the first resources they go to when a question arises. Moreover, for many search engines have become the most trusted route to information, more so even than traditional media such as newsp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011